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Abstract 

This article considers a model of genealogy corresponding to a regular ex- 
changeable coalescent (also known as H-coalescent) started from a large 
finite configuration, and undergoing neutral mutations. Asymptotic ex- 
pressions for the number of active lineages were obtained by the author 
in a previous work. Analogous results for the number of active mutation- 
free lineages and the combined lineage lengths are derived using the same 
martingale-based technique. They are given in terms of convergence in 
probability, while extensions to convergence in moments and convergence 
almost surely are discussed. The above mentioned results have direct con- 
sequences on the sampling theory in the H-coalescent setting. In particular, 
the regular H-coalescents that come down from infinity (i.e., with locally 
finite genealogies) have an asymptotically equal number of families under 
the corresponding infinite alleles and infinite sites models. In special cases, 
quantitative asymptotic formulae for the number of families that contain a 
fixed number of individuals can be given. 
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1 Introduction 



Kingman's coalescent [T5| [16] is one of the central models of mathematical popu- 
lation genetics. From the theoretical perspective, its importance is linked to the 
duality with the Fisher- Wright diffusion (and more generally with the Fleming- 
Viot process). Therefore the Kingman coalescent emerges in the scaling limit of 
genealogies of all evolutionary models that are asymptotically linked to Fisher- 
Wright diffusions. From the practical perspective, its elementary nature allows 
for exact computations and fast simulation, making it amenable to statistical 
analysis. 

Assume that the original sample has n individuals, labeled {1, 2, . . . , n}. The 
genealogy is a process in which ancestral lineages coalesce in continuous time. 
One can identify the original sample with the trivial partition {{1}, . . . , {n}}, 
and moreover, at any positive time, one can identify each of the active ancestral 
lineages with a unique equivalence class of {1, 2, . . . , n} that consists of the labels 
of all the individuals that descend from this lineage. In this way, each coalescent 
event of two (or more) ancestral lineages can be perceived as the merging of the 
corresponding equivalence classes. Ignoring the partition structure information, 
one can now view the coalescent as a block (rather than equivalence class) merging 
process. 

Kingman's coalescent corresponds to the dynamics where each pair of blocks 
coalesces at rate 1. In particular, while there are n blocks present in the config- 
uration, the total number of blocks decreases by 1 at rate (™) . In 1972 Ewens 
[T2] derived a sampling formula that holds for several neutral population evolu- 
tion models, and in particular for the Kingman coalescent. Its importance is well 
indicated by almost 900 scientific citations in less than four decades^- The Ewens 
sampling formula will be recalled in Section 13.11 

The fact that in the Kingman coalescent dynamics only pairs of blocks can 
merge at any given time makes it less suitable to model evolution of marine pop- 
ulations or viral populations under strong selection. In fact, it is believed (and 
argued to be observed in experiments, see e.g. [17]) that in such settings the re- 
production mechanism allows for a proportion of the population to have the same 
parent (i.e., first generation ancestor). This translates to having multiple collisions 
of the ancestral lineages in the corresponding coalescent mechanism. 

A family of mathematical models with the above property was independently 
introduced and studied by Pitman [23] and Sagitov [21] (see also the remark on 
page 195 of Donnelly and Kurtz [8]) under the name A-coalescents or coalescents 
with multiple collisions. Almost immediately emerged an even more general class 

1 source: ISI Web of Knowledge http : / / apps . isiknowledge . com/ 
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of models, named "E-coale scents or coalescents with simultaneous multiple collisions 
or exchangeable coalescents. The S-coalescent processes were initially studied by 
Mohle and Sagitov [2ij, and introduced by Schweinsberg [25] in their full gen- 
erality. In particular, it is shown in [21J that any limit of genealogies arising 
from a population genetics model with (time-homogeneous) exchangeable repro- 
duction mechanism must be a S-coalescent. For additional pointers to the recent 
coalescent literature see [21 E] - 

The present article can be considered as a sequel to Berestycki et al. [3] and 
Limic [IB]. It demonstrates once again the power of martingale techniques in the 
study of exchangeable coalescents. In fact, the main result is proved by a variation 
of the technique from [H [IB] • The primary interest of the current work however 
is its potential for applications in studies of sampling statistics. 

Unlike [31 [TBI H] , the present work does not focus primarily on the coalescents 
that "come down from infinity". Indeed, the starting configuration is finite in 
the current context, so as long as a certain regularity condition (see [IB] or (jRj) 
at the beginning of Section [3]) holds, the small-time asymptotic results derived 
here (in Theorem []] and Proposition [2]) apply. All the A-coalescents (as well as 
certain general exchangeable coalescents of particular interest in mathematical 
population genetics [HI 127] ) are regular in this sense (see also Remark 12 in 
[IB]). If a regular S-coalescent has a locally finite genealogy (or equivalently, if its 
standard version comes down from infinity), its small-time asymptotics determines 
the asymptotic growth of the corresponding combined lineage length, and in turn, 
the asymptotic number of families in the infinite alleles (resp. sites) model (see 
Theorem [3]). It is worth pointing out that Proposition [2] is the first example of a 
"meta-theorem" that applies in the more general context of regular exchangeable 
coalescents that have (potentially random but) uniformly bounded rate (per unit 
lineage length) of mutation, recombination and/or migration (see also Remark 
[6]). Some consequences on the asymptotic frequency spectrum are discussed in 
Section 13 . 11 

This paper inherits the basic notation from [IB] . For the benefit of the reader 
we recall it here. The set of real numbers is denoted by R and (0, oo) by R + . For 
a, b G R, denote by a A b (resp. a V b) the minimum (resp. maximum) of the two 
numbers. Let 

A := {(xi,x 2 ,...,) : x x > x 2 > . . . > 0, < 1}, (1) 

i 

be the infinite unit simplex. For x = (xi, x 2 , ■ ■ •) G A and c G R, let 

CX = (cXi, CX2, • • •)• 
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The notation log is reserved for the natural logarithm, that is, the inverse of 
R 3 x e x G (0, oo). If / is a function, defined in a left-neighborhood (s — e, s) of 
a point s, denote by f(s-) the left limit of / at s. Given two functions /, g : 1R + — > 
R+, write / = 0(g) if limsup f(x)/g(x) < oo, / = o(g) if limsup f(x)/g(x) = 0, 
and / ~ g if lim/(x)/^(x) = 1. Furthermore, write / = 0(g) if both / = 0(g) 
and g = 0(f). The point at which the limits are taken is determined from the 
context. If J 7 = (Tfr,t > 0) is a filtration, and T a stopping time relative to J 7 , 
denote by Tt the standard filtration generated by T, see for example [10], page 



The rest of the paper is organized as follows: the model of interest is described 
in Section [21 in Section [3] the main results are stated, followed by a discussion of 
some consequences, while the arguments are postponed until Section HJ 

2 Definitions and preliminaries 
2.1 The model 

For the purposes of this note, the precise definition of exchangeable coalescent 
processes (and their standard infinite version) is not essential. An interested reader 
can consult any of the references [251 121 EEE] for details of the construction, and 
further properties. 

Instead, we next construct a related "genealogical" process which, together 
with a suitable enrichment, will be in the focus of the present study. Suppose 
that H is a probability measure on A. Assume that we are given a Poisson Point 
Process on M + x A 



with intensity measure dt ® H'(dx)/ Y^Li x h where E(dx) = (1— H / (A))5(o,o,...)(^x) 
+S'(dx) and S'((0, 0, . . .)) = 0, S'(A) E (0,1] (the trivial case S'(A) = corre- 
sponds to the Kingman coalescent). 

Let n be a finite (typically large) integer. The n-genealogy (associated to S) 
evolves as follows: at the initial time 0, there are n branches present in the system, 
each having trivial length 0. As time increases, the length of each branch increases 
at unit rate. An atom (t, x) of n influences the evolution as follows: at time t let 
each branch present in the system at time t— choose an i.i.d. color from NU (0, 1), 
independently of the past evolution, according to the common law 



389. 




(2) 



CO 



Pn({i}) = x%, i > 1 and P x (du) = (1 — Xi) du, u 6 (0, 1). 



i=i 
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For each j > 1, all the branches of color j collapse immediately (at time t) into 
a single branch. In addition, each pair of branches coalesces at rate 1 — H'(A) 
independently of the above color and collapse procedure. The readers might notice 
that whenever J A E'(dx)/ x l = °°5 the time-projection of the set of atoms of 
it is dense on any interval of positive length. This could a priori cause difficulties 
in the construction, however, the above branch growing and collapsing process 
is always well-defined, as can be directly verified (or check [181 125] for a similar 
construction of exchangeable coalescents). Furthermore, it is important to note 
that the just described construction can be coupled over n G N in such a way 
that for any two m, n, where m > n, restricting the process started from m 
distinct branches onto the first n branches gives precisely the process started 
from n branches. We call this coupling the full genealogy or the full S- genealogy. 

Remark 1. Setting S(A) = 1 is convenient but arbitrary. It is easily seen that 
the time scale of any exchangeable coalescent can be multiplied by a constant 
factor, so that the law of the resulting process matches that of an exchangeable 
coalescent corresponding to a driving measure H of total mass 1. O 

One can enrich the above construction as follows: let v be a completely in- 
dependent Poisson Point Process of marks that cirri vg dbt fcitc 7 per unit length. 
This means that while there are m branches present in the system, the next mark 
(mutation) arrives at rate mj, and when it arrives it is placed uniformly at ran- 
dom (and independently from everything else) onto one of the branches. The 
parameter 7 is usually called the mutation rate. In this enriched construction, 
each branch is in one of the two states, open or closed, at any given time. Initially 
all the branches are open. A branch becomes closed starting from the moment a 
mark arrives to it, additional marks that possibly arrive afterward onto it do not 
change its state. Immediately after two or more branches collapse into one, the 
state of this newly formed branch is determined as follows: if at least one of the 
contributing branches is open, the new branch is open, otherwise it is closed. It 
is clear that one can couple this enriched genealogy again over n£N, and we call 
the resulting process the full marked genealogy. 

Let J- = (J-j, t > 0) be the filtration generated by the full marked genealogy. 
Denote by iV = N n the branch counting process, and by N° = N n '° (resp. N c = 
N n ' ,c ) the open (resp. closed) branch counting process. Note that, for each n, all 
the processes N n , N n '°, N n]C are J-"-adapted. It is clear that 

N(t) = N°(t) + N c (t), t > 0. 

Moreover, note that both iV and iV°, unlike iV c , are monotone (decreasing) pro- 
cesses. In addition we have that, almost surely, for each t > and n > 1, 

N n (t) < N n+1 (t) and N n '°(t) < N n+1 '°(t), 
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but it is not necessarily true that N n;c (t) < N n+1 ' c (t) (on Figure 1, iV 1;c (ti) > 
N 2 ' c {ti)). ^ f° r each t > the family (N n (t),n > 1) is tight, or equivalently 
if lim n N n (t) exists and is a finite random variable, we will say that the full El- 
genealogy is locally finite. This property is equivalent to the above mentioned 
coming down from infinity property for the (standard) H-coalescent. 
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Figure 1: A realization with n = 9 particles labeled by {1, . . . ,9}, and coalescent events 
occurring at times t±,t2 and t%. Open branches are indicated as thicker lines, and any 
mutation that falls onto an open (resp. closed) line is depicted as filled (resp. unfilled) 
circle. 

Call a mutation (or mark) that arrives onto an open (resp. closed) branch open 
(resp. closed). Furthermore, denote by M = M n the mutation counting process, 
and by M° = M n; ° (resp. M c = M n;c ) the open (resp. closed) mutation counting 
process. 

Clearly, 

M(t) = M°(t) + M c (t), t > 0. 

For the realization in Figure 1, N(ti) = 4 = N°(ti), while M(ti) = 5 and M°{ti) = 
4. We also have N c (ti-) = 4, while N^h) = and N c (t 2 -) = 2. 
Let 

t 11 = t? := M{t > : N n (t) = 1}, (3) 
be the time of collapse to a single lineage, and 

r r : := infO > T n : Av(t) > 0}, 

be the arrival time of the first mutation to this unique lineage. 
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In [4] it was noted that M(r") = M(r™) + 1 can be interpreted as the total num- 
ber of "families" in the corresponding infinite sites model. For the realization on 
Figure 1 we have r n = t 3 , and the "families" are {1}, {3}, {3}*, {4}, {7}, {8}, {6, 7, 
8, 9}, {6, 7, 8, 9}*, {6, 7, 8, 9}**, {1, 2, 3, 4, 5, 6, 7, 8, 9}, where the superscripts indi- 
cate the difference of the two families even though their contents, as subsets of N, 
are identical. 

Note in addition that M°(r") can similarly be interpreted as the total number 
of families in the infinite alleles model. For the realization on Figure 1, the (infinite 
alleles) families are {!}, {2, 5}, {3}, {4}, {6, 9}, {7}, {8}. 



3 Main results 

Suppose that the driving measure H satisfies 



(E~i 



\2 



Eoo 
t=l 



2 S(tfoc) < 00. (R) 



x- 



This regularity condition already appeared in [IS]. As observed in the introduc- 
tion, all A-coalescents (the Kingman coalescent included) are regular. 

Remark 2. In view of these observations, the ratio (^V X{) 2 / X^i x 1 should 
be interpreted as at (0,0,...), so that the regularity of any S-coalescent is 
determined by its "non-Kingman" part H'(dx) = E(g?x)1a\{(o,o,...)}" 

Presently, (|R|) seems necessary for the martingale analysis below to work, 
although certain modifications of the technique might lead to stronger results, 
see Remark 22 in [18]. We refer the reader to Section 3.2 in [18] for examples of 
non-regular coalescents. O 

In analogy to (j3J) define 

T b n := inf{t > : N n {t) < b}, r 6 n; ° := inf{t > : N n '°{t) < b}. (4) 

It is easy to see that (cf. [H HB] ) /* u, almost surely, where 

P(r 6 G(0,oo]) = l (5) 

(note that r& takes value oo precisely in the cases where the genealogy is not locally 
finite). Similarly, one can show via the the full (marked) genealogy coupling that 
T b '° /* T b > where 

P(7?e(0,oo]) = l. (6) 
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The fact that r^'° is non- decreasing in n is a direct consequence of the coupling. 
To see that the limit cannot take value 0, note that {N n '°(s) > b} = {r^'° > s}, 
and that for each s > 0, b G N, 

P(iV n; °(,s) > b) > F(N n (s) > b, the first b + 1 branches stay open during [0, s)), 

where the right hand side is non- decreasing in n, and where its limit p(s, b) (mono- 
tone decreasing in both variables) satisfies lim s ^ p(s, b) = 1, b > 1. 

Let a G (0,1/2) be arbitrary but fixed. The main results of this article are 
given next, starting with the most general ones. 



Theorem 1. Under (Tfl]J. there exists n G N such that for each t > and (3 G 
(0, a A 1 — 2a) 

(M n ' c (t A T n '°) 
-—-4 — njLL - | = o{e + **-*•), (7) 
T Jo AT«(u)rfM / 



lim sup P 



M n;c (t A r n, 0) 

M«(tAr„T) " 




Remark 3. Since the estimates are non-trivial only for t < 1, the optimal choice 
of a is 1/3. O 

This theorem is a consequence of the next result. 

Proposition 2. Under / Tff]) . i/iere exists no G N swc/i i/jai /or eac/i s > 



P ( sup 

te[o,s] 



1 - 



N n (t A t, 



n;o\ 
"0 J 



> 8s a = 0(s ), uniformly over n G N. (9) 



Remark 4. Note that it is not true in general that lim n _ i . 00 M n '°(t)/M n (t) = 1 for 
a fixed t > 0. For example, in the case where H is a Dirac measure 5(x,o,...) f° r some 
a; G (0, 1), all the original n branches remain in the genealogy for an exponential 
(rate 1) amount of time (regardless of n), therefore N n (t) ~ n and N n '°(t) ~ ne~ Jt 
at t ~ (uniformly in n), implying M n (t) ~ nt and M n '°(t) ~ (1 — e~ lt )/ r yn. In 
this case, lim^oo M n;0 (t)/M n >* = /(*) where lim t ^ /(*) = 1- 

All the S-coalescents without proper frequencies (cf. Mohle |20j) are regular, 
but they do not have locally finite genealogy. In [20] it is shown that the ap- 
propriately scaled total lineage length of coalescents without proper frequencies 
converges in distribution to a non-trivial random variable. O 



S 



However, if the underlying full genealogy is locally finite, one has M n '°(t) ~ 
M n (t) for a fixed time t. This result is stated next in a slightly different form, 
that may be more interesting from the perspective of applications. Define 



= MQ) ■= I ^ =ll Voo : \ H l> S(dx). (10) 



J2™i( e qXi - 1 + i x % 

Note that the above integral converges since e~ z — 1 + z < z 2 /2, z > 0, and, in 
particular, 

MQ) < 5 2 /2. (11) 

Define t i-> w n (t) G M+ by 



dq 



(12) 



£(n) := / -TT^dg. 



and let 

Theorem 3. If the full genealogy is locally finite and ( Tffj) holds, then 

M n (r n ) , M n (r n ) , M n '°(r n ) . . 

lim — — t — - = lim — — = lim = 1, zn ■probability. (13) 

n->oo M n; °(T n ) n->oo 'y ■ £(77,) n— >oo •y ■ £(n) 

Remark 5. In the A-coalescent setting (where ([Rj) automatically holds) this re- 
sult coincides with the initial part of Theorem 2 in [3]. The proof of this and 
related results in [1] is based on studying the asymptotic behavior of the arrival 
time of a (uniformly chosen at) random mutation, as n — > oo. Under additional 
assumptions on A, this asymptotics can be quite precisely determined, leading 
to the convergence almost surely results in [1] (also discussed in the paragraph 
containing (fT5T) in Section l3TTj) . The martingale-based technique presented here is 
not suitable for deducing precise information about the randomly chosen muta- 
tion. However, it is more compact than the technique from [1] (that still relies on 
martingale estimates from [3]), provides partial information even for non- locally 
finite genealogies, and is better suited for finding error bounds as well as exten- 
sions (like Theorem H] or the case of random uniformly bounded mutation rates 
as discussed in Remark [6]). O 

Define 

£ t (n):= [ v n {u)du= [ -j^dq. 



Jo Jv"(t) ip(q) 

Similar arguments as for Theorem [3] yield the following generalization: 
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Theorem 4. If the full genealogy is locally finite and / Tfl]) holds, then for any 
bounded sequence (£„)„>i of positive numbers such that £t n (n) diverges as n — >■ oo 

lim t~t = ^ m « = ^ m 7^; — 1) in probability. 

n^oo M n '°(t n ) n->oo <y ■ £ t n [n) n-5>oo 'y • £ in (n) 

3.1 Consequences, extensions, and further comments 

Theorem [3] can be restated as follows: for any regular H-coalescent that comes 
down from infinity, the number of families in the infinite alleles and the infinite 
sites model are asymptotically equal, in probability. 

If more is known about H, the convergence of Theorem [3] can be extended to 
convergence almost surely, using arguments very similar to those of Section 4.4.1 
in [3]. Furthermore, one could prove that 

N n (t) 

lim — = 1 in L p , Vp> I. 

t^o v n (t) ~ 

Theorems [3H11 could then be extended to convergence in the mean (see Remark [9] 
at the end of the article for details). 

The asymptotics (fT3ll holds (see Drmota et al. [9] and Basdevant and Gold- 
schmidt PQ) also for the Bolthausen-Sznitman coalescent, which does not have 
locally finite genealogy. The author is convinced that the arguments of Theorem [3] 
could be extended to cover this special setting (where A(dx) = E(d(x, 0, 0, . . .)) = 
dx, x e [0, 1]), and provide a shorter (and more generic) proof of this result (see 
Remark [8]) . 

For each fixed r G N, set M™(t) := ^{mutations that arrive before time t 
and affect precisely r individuals}, and M" ; °(t) := ^{open mutations that arrive 
before time t and affect precisely r individuals}, and 

M r n := M r n (r, n ), M r n; ° := M r n; °(r;). 

In the case of Kingman's coalescent, the Ewens sampling formula [12] gives the 
joint law of the above family of random variables as follows: given i — 1, . . . , n 
such that cii > and XT=i = n ' 



p {M r = ,, i = i, ...,») = 27(27+ 1) ."' (27+n _ 1) n (14) 



For a general exchangeable coalescent one can only hope for asymptotic analogues 
to (I14p . The first such approximations were given by Berestycki et al. [6j |5] for 
all the Beta-coalescents with locally finite genealogies. 
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Corollary 5. Suppose that £{n) ~* n 13 for some (3 G (0, 1), where ~* stands for 
~ up to a slowly varying multiple. 

both M™ and M™ ; ° are asymptotic to j3T(r — j3)£(n)/r\, in probability, (15) 

where due to assumptions V(z) = J* °° t z_1 e _ * dt is well defined at r — (3, r G N. 

Sketch of the proof. Note that both M r n - M"(r n ) G {0, 1}, M r n; ° - M r n; °(r n ) G 
{0, 1}, where these differences are positive for at most one r G {1, . . . ,n}. One 
can combine Theorem [3] with Theorem 1.2 in Schweinsberg [26J to conclude the 
claim (see also [1]). □ 

A stronger version of this result (in the sense of almost sure convergence) was 
obtained recently in [3], applying the main result of [13], for a class of A-coalescent 
with "/3-regular variation at 0" , that comprises the class of the above mentioned 
Beta-coalescents. However, £(n) might be asymptotic to even without the 
/3-regular variation condition. 

More generally, if £{rik) ~* {nkY for some /3 G (0, 1) along a given subsequence 
(nk)k>i, we obtain the same asymptotics for M" fe and M" fe; ° as in Corollary |5j In 
some (rather vague) sense, for each fixed r, M" fe and M" fc; ° should both be close 

to log IK ) TV r _ log/(n fe )N^/ y r | 
logn fc V logn* ^ 2_ 

One can generalize (ITSl) in a different way: given any (t n ) n >i such that £i n (n) ~* 
n' 3 , for (3 G (0, 1), one can combine Theorem H] again with Theorem 2 in [26] to 
conclude that both M™(t n ) and M" ; °(t n ) are asymptotic to /3T(r — (3)£ tn (n)/r\, in 
probability. Again, this convergence could hold only along subsequences. 

It would seem useful for applications to not only know the first order asymp- 
totics but also the error terms, and there seems to be no good general approach 
for obtaining these. However, for specific 5 (that is ip) one could get concrete 
error bounds by redoing the general calculations (in the next section) with this 
given ip. 

It is not difficult to see that the asymptotic results given here depend only on 
the behavior of measure 5 "close to (0, 0, . . .)". More precisely, if and S 2 are 
two measures on A satisfying 

S x (dx) 1{£^< £ } = 2 2 (dx) l{^. Xi <e} (16) 

for some e > 0, then the asymptotic quantities in Theorems dHU corresponding to 
Hi and H 2 will be identical. This will continue to hold even if ( IT6|) is true only in 
the ~ sense, in fact, as long as i^s t (q) ~ i>E. 2 {q) as q — > oo. 
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What might be surprising (even disturbing) is that for any probability measure 
Son A there exists a probability measure A on [0, 1] such that 

Ml)=Ml), Vg>0, 

where t/}\(q) is defined as L J^~ qx — 1+qx) A(dx)/x 2 . The question of whether the 
measures 5 and A can be connected in terms of a stochastic coupling construction 
remains open. To verify the above identity, one can check that / := ip' E is a 
completely monotone function, meaning that (—l) n ^f(q) > on (0, oo). Then 
due to Bernstein's theorem / is a Laplace transform of a positive Borel measure 
on [0, oo), and it can be written in the form 

POO 

Ml) = /(?) = a + bq+ (1 - e"**) fx(dx), (17) 

Jo 

where \i is a measure on [0, oo) such that f(l A x) fj,(dx) < oo, and /x({0}) = 0. 
Differentiating (fT0|) twice gives ^=(0) = and V'a(O) = 1, so it must be a = and 

/■oo 

1 — b+ x n(dx). 
Jo 

Moreover, ipz(q)/q 2 ->■ S((0, 0, . . .))/2, thus (PU) implies that 6 = S((0, 0, . . .)). 
Then the above identity gives S((0, 0, . . .))+ x n(dx) = 1, so A(dx) := x fj,(dx)+ 
H((0, 0, . . .))8o(dx) is a probability measure on [0, oo). By (|T7|) and the fundamen- 
tal theorem of integral calculus one now has identity 

mq) = b t + r - e_te ) ^ = 6 C + r gx + 6-91 ~ 1 m^). 

^ Jo Jo 1 Jo x 

The last quantity is identical to ipA, provided that the support of A (that is, of /i) 
is the unit interval. This last claim can be checked by using the inverse Laplace 
transform formula (see, e.g., [10], Example 5.4) together with the definition of ips,- 

4 The arguments 

The proof of Proposition |2] is based on the martingale technique from [18], that 
originated in [3] in the A-coalescent setting. More precisely, Proposition 17 in [T8] 
shows existence of n e N and C G (0, oo), such that 

E[d\og(N n (s)) \F„] = } +^)) da, (18) 
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where (h n (s), s > z) is an ^-adapted process satisfying sup sg j 2 zAT n 1 |/i n (s)| < C, 
uniformly over n, and where 

E[[dlog(N n (s))} 2 | 7" s ]l {s < r n o} < Cds, almost surely (19) 

A crucial new observation is that N n '° behaves analogously. Indeed, for x G A, 
let (Xj, j G N) be a family of i.i.d. (generalized) random variables with law P x , 
where 

oo 

P x (Xx =i)= Xi , i G N, and P x (Xx = oo) = 1 - J^a*, 

i=l 

Furthermore, for each b G N, let the family (Y^, i G N) of random variables be 
defined by 

b 

Y} b) :=Y,l{x J= i } , ieK (20) 
i=i 

Then we have, on the event {N n ' ,0 (s) = b} (see the proof of [IB], Proposition 17), 
£7(dlog(iV n;o (s)) | Fs) = [ £ x 

+ (1 - H'(A)) Q log til + bg ■ 7 6. (21) 

The first two terms in the drift above are identical to the only terms in the 
expression for the infinitesimal drift of log(iV n ) (at time s on the event {N n (s) = 
b}), leading to (TIB]) . The third term comes from the additional loss (that is, 
closure) of open branches at constant rate 7. Since 61og((6 — l)/b) = — 1 + o(l), 
one obtains 

E[d\og(N^(s)) I JF S \ = f-t^l^l + hXs)) ds } (22) 

for n ,C from (TlSI) and C Q = C + 7, and where sup ae [ 2>2Ar n^o] |/i"(s)| < C , uni- 
formly over n. Moreover, since 6 log 2 ((6- l)/6) = o(l), the bound from ([19]) holds 
with iV n; ° (resp. C ) in place of N n (resp. C). 

Recall the map v n defined in (fl2]) . The argument of [18], Section 4.1, Part I 
already yields (cf. display (40) in [IS] ) 

P sup 

\te[o,s] 



log- 



Si=l x i 



H (dx) (is 



log 



iV n (t At;, 



w n (t A r. 



re ' 
nn - 



> 2s c 



01 



A-2a\ 



(23) 



13 



but it clearly carries over to imply 



P sup 

V te[o,s] 

Since for e G (0, 1] we have | log(x)| < e iff 1—x G [0, 1 — e~ £ ] implying l—x G [0, e], 
or x — 1 G [0, e £ — 1] implying x — 1 G [0, 2e], one obtains ([9]) by combining the 
estimates in (J2"3j) and ( 124]) . and noting that < r" , Vn, n in the full genealogy 
coupling. Alternatively, the same could be concluded by applying the argument 
leading to ( I23|) -( p4|) in terms of the process log(iV n /iV n;0 ), that is, by comparing 
directly N n and N n '°. 

Remark 6. It is important to note that if the mutation rate per unit length were 
not constant but given instead as a non-negative J-"-adapted stochastic process 
(jt, t > 0) such that P(sup i>0 7 t < 7) = 1 for some 7 < 00, then (T2T]) would 
become 

E(d\og(N n >°(s))\ JF S ) = f E x 

J A 

+ (1 - H'(A)) Q log til + i og tzl . ls . 6 . (25) 

Since log ^y- ■ 7 S • b < 7, the rest of the proof of Proposition [2] would remain 
identical. Moreover, it is simple to see that the forthcoming arguments would 
easily carry over to yield appropriate analogues of Theorems (H [3] and H] in this 
general setting, where 7 J N n (u) du is being replaced by J ^ u N n {u) du and 7-^(71) 
by £(n;7) := f Q M( r y u )v n (u) du, under additional hypotheses on the process 7 
that would guarantee J* ^ u v n {u) du ~ £(n;7), as well as the divergence of the 
sequence (£(n; 7)) n >i- F° r example, for estimate (T26|) one would use the stochastic 
domination with the Poisson (mean 7 L N n;c (u) du) law, instead of the law itself. 

As mentioned in the Introduction, Proposition [2J is just an example of a fairly 
general result that could be stated as follows: if the full genealogy dynamics 
is enriched so that in the new process the branches are called either "open" or 
"closed", and the open branch count process satisfies (!25l) . then (jSJ) holds. Two 
recent specific examples from this class of results are [H] (verifying a weaker 
coming down from infinity criterion for a A-coalescent with migration model) and 
[22] (deriving the speed of coming down from infinity for the Kingman coalescent 
with constant recombination rate). The proof of this "meta-theorem" is essentially 
given in the previous paragraph. O 



log 



N n;o {tAT n, 0) 



'(iAr n n , 



> 2s c 



Ois 1 ' 201 ) 



(24) 



log 



E 



00 2 



E'(dx) ds 
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Proof of Theorem [IJ Due to the construction of the full marked genealogy 
coupling, we know that, given the path of the processes N n up to time t, M n (t) is 
a Poisson (mean 7 j^N n (u) du) random variable, that can be obtained as a sum 
of M n '°(t) and M n ' c (t). Moreover, if N n '° (that is N n ' c = N n - N n '°) is given 
in addition, then M n]C (t) is a Poisson (mean 7 J Q * N n ' c (u) du) random variable, 
conditionally independent of M n ' ,0 (t). The last observation is true if the fixed 
time t above is replaced by the random time t A r"^ , measurable with respect to 
a{N n (u), N n '°(u), u < t}. Note that M n '°(t) cannot have (conditional) Poisson 
distribution, since P(M n; °(t) < n) = 1. Note in addition that M n (t A r™ ) is a 

tf\T n 

Poisson (mean 7 J n ° N n (u)du) random variable, given JF T n . We cannot say 
the same if r" o is replaced here by r™ ;o , since knowing N n (r^ ) > n (that is, at 
least one branch is closed at time r^°) excludes the event {M n (r" ;o ) = 0} that no 
mutation arrived prior to time t^°. Due to this, some additional technical steps 
are needed below (notably, in arguing (1271) -( 1251) ). 

Due to Proposition [2J on the event A t = {N n ' c (u) < 8t a N n (u),Vu e [0, t A 
r £;°]} = {N n '°(u) > (1 - 8t a )N n (u), Vu E [0,t A r^°]} of probability greater than 
1 - Oit 1 - 2 *) we have 



1At„ 



iV n;c («) du < 8t c 



1At„ 



iV n (u) du. 



On the complement of A t we have /q A7 ™° N n ' c (u] 



,. v du < J n ° iV n (w)dw, since 
j\rn-c < jyn ag a conse q Uence Q f the full marked genealogy coupling. Hence we 
compute 



E 



M n ' c (t A r™<°) 
J * N n {u)du_ 



E 



E 



E 



M n ' c (t A r";' 



/J "° N n (u)du 
7 J ATn ° N n ' c (u) du 



< ^(8t a F(A t )+F(A c t )) 

and the estimate ((TJ readily follows. 

In order to obtain ((5]), we first note that 



((AT>),A^>)), M e[0,iA<; 



M"(tA<;°)< 



I I AT "° iV»(u) du J C jjl^f A r» ) < I ^ ^^V) **} U 
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The estimate © will now simply follow from flTJ), the Markov inequality, and 

limsup P I M n {t A r£ ) < ~ J N n (u) duj = 0, (27) 



limsup P [ M n (t A r£ ) - M n (t A r^°) > ^ / iV» du ] = 0. (28) 



tf\T n ' Q 



To obtain (1271) . we first note that 

(N n (u), u < t At")) < ^e-^o""" ^)**), 



P(M"(tA<J<- / iV»rf M 



*AT™ 



iAr 

for some c > 0, uniformly over n (and 7), and also that (J ™° N n (u) du) n >i 
diverges, almost surely, as n — > 00. Indeed, this sequence of random variables is 
monotone increasing in n, so it must be converging, almost surely, to a possibly 
generalized (i.e., taking value 00) random variable. Moreover, the divergence with 
probability 1 is clear if the full genealogy is not locally finite. Otherwise, note that 
(Jq N n (u) du) n >i diverges while (/A r „ N n {u) du) n >\ converges, almost surely, as 

n — > 00. The latter statement is clear due to (jHj). The former follows from the 
fact that lim n f n (t) = v(t) < 00, the inequality ( ITTT) . the asymptotics (1231) . and 

v n {u)du= / -yf-dq> / -dq^ - dq. (29) 
Jv"(t) W{Q) Jv n (t) Q Jv(t) Q 

The first identity above, as noted in [3], is due to the change of variables q = v n (u), 
dq = —ip{v n {u)) du. 

To obtain (J2HD, note that M n (t A r™ ) - M n (t A t%>°) is a Poisson (mean 
7 J tA N n (u) du) random variable given J>n . Moreover, in the setting of locally 

finite full genealogy, due to (El), the sequence ( f n ; ° N n (u) du) n> i is convergent 

# jk n;o 

(tight), while {fo Tn °N n {u)du) n >! is always divergent, implying (}2"8"|) . Otherwise, 
both r^ and r" o diverge to oo as n -> oo, almost surely, so M n (t At™)- M n (t A 
r nd°) = 0' except on an event of negligible probability, again implying f l28|) . 
Proof of Theorem The same change of variables as in fl29|) gives 

u n (u)du = / -f-dq = £(n), 



where 
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i ^(g) 



l V'(g)' 



The hypotheses of locally finite full genealogy and regularity imply (see |25j . 
Proposition 33 or [18], Theorem 1) that J a °° dq/ip(q) < oo (for any a > 0), and 
therefore that (t> n ) _1 (l) increases towards a finite limit 1*. As a consequence, £(n) 
diverges as n — )■ oo (see (1291)). 



Remark 7. Note that £{n) ~ J Q a v n {u) du for any fixed a > 0. O 

It suffices to prove the convergence in probability for M n (r n )/('y£(n)) and 
M n '°(T n )/('~f£(n)). We discuss the convergence of the former sequence in some 
detail below, and give at present the reasoning for the latter convergence assum- 
ing the former: since M n (r n ) = M n; °(r n ) + M n ' c (r n ), it suffices to show that 
M n ' c (T n ) = o(£(n)), in probability This will follow by (JSJ, provided M n;c (r n ) - 
M n ;c j r n; A t ) = (£(n)) for some fixed t. However, M n;c (r n ) - M" ;c (r^° At) < 
M n {r n ) — M n (r^° A t) in the full (marked) genealogy coupling, and M n (r n ) — 
M n (r^°At) is (as in the proof of Proposition [2]) a conditional Poisson random vari- 

able with mean 7 ■ / „ ;OA , N n (u) du. So the above claim follows due to convergence 

(tightness) of J\-,o At N n (u) du and divergence of £(n). 

The rest of the argument is analogous to that for Theorem 5 in [3] . The bulk 
of it is showing that JJ" N n (u) du ~ £(n), in probability, as n — )• 00. Since, given 

f'T n 

N n , M n (r n ) is a Poisson random variable with mean J N n (u) du, and since £(rz) 
diverges with n, one concludes (as a special case of LLN) that M n (r n ) ~ / y£{n), in 
probability. It therefore suffices to show that for any subsequence there exists 

a further subsequence such that J N nk j (u) du/^n^) — > 1, almost surely. 

To simplify the notation, we rename the subsequence (nk)k>i as (k)k>i- Recall 
ff23|) . and for a fixed a G (0, 1/2) choose a decreasing sequence (sk)k>i of positive 
numbers, such that 



E 



4" 2Q < 00. 



Then we can conclude from (151), ([23]) and the Borel-Cantelli lemma that 



lim sup 

fc-Kx) te[o,s k 



N k (t) 



v k (t) 



0, almost surely. (30) 



Due to the assumption of locally finite full genealogy, we can now choose a sub- 
sequence kj such that v kj {u) du diverges as j — > 00, and that also both 



v k > (u) du j I' v k > (u) du and / N kj (u) du / I ' N k > (u) du (31) 

>i JO J Si JO 
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tend to as j — » oo (for the second sequence, the limit is taken almost surely). 
Joint with ( )30|) . this ensures that 

N k > (u) du J° J N k > (u) du 

lim = lim — = 1, almost surely. 

Jo vkj ( u )du J J v k j{u)du 

Remark 8. If H(<ix) = l{ x =(a;,o,o,...)} for x G (0, 1), or equivalently, in case of the 
Bolthausen-Sznitman coalescent we have 

ip(q) = q log q + O(q), as q -)■ oo, 

with 0(q) > 0, for all q > 0. Therefore 

f ra (t) G [n e * (1+o(1>> ,n e '] as well as £(n) ~ , as n — > oo. 

log n 

We conclude that v n (t) is of order 1 at times of order loglog(n) and in turn that 

<>log log n 

v n {t) dt = o{£{n)), as n -> oo. (32) 

In order to show that J N n {u) du ~ ^(^), it therefore suffices to start as in the 
paragraph which comprises fl30l) - fl3Tl) . ensuring the following analogue of fl3Tl) 



^ (it) du j I ( u ) du^O and / N k > (u) du / N k i (u) du 0, 

JO J Sj JO 

and to verify in addition that (possibly along a further subsequence) 

\ N k i (t) dt = o(£(kj)), as j oo. (33) 

This can all be done due to (1231) . the fact that v n (t) is bounded by a power of n 
smaller than 1 for each fixed t, asymptotically in n, and finally the estimate 

Er n = O (log log n), 

which can be obtained via optional stopping of the martingale M t := f^n^ — t, 
t > 0, where x/)(q) := L ^((1 — x) q — 1 + qx)/x 2 = ip(q)+0(l) (see [19] for further 
use of M and its generalizations). O 
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Proof of Theorem^ Due to the assumption that £ t „ (n) diverges, one can argue 
as in the proof of Theorem [3] that it suffices to show that 

M n (t ) 

" ->■ 1, in probability. (34) 



The argument for (134]) is analogous to the last one. In fact, with the same 
choice of the sequence (sk)k>i as above, one now chooses a subsequence kj so 
that Jq° A v k] {u) du diverges as j — > oo, and in addition both 



(u) du l{ tj>Sj } I J v 3 (u) du and J (u) du l { t ]>Sj } / / N fcj (u) du 
tend to as j — > oo. Joint with f l30|) this ensures that 
lim 



f* J N k * (u) du it 3 ASj N k > (u) du 

\ — — = lim — — = 1, 

jV vkj ( u ) du f Q 3 3 v k i{u)du 



almost surely. Finally, AM]) follows by another application of LLN. 

Remark 9. The arguments of [3], Section 4.3 apply in the regular setting. Indeed, 
instead of [3], Lemma 20 one now has the following statement: There exists uq G N 
and Kq < oo such that for all b > n , x 6 A H {x : — 1/4}, c > 0, and 

given by fl20l we have 

oo 

^x[exp{c[log(6-^(F/ b) -l {y(b)>0} ))-log6] 2 }-l]<e 9c / 4 K [(^x, 2 ) + (^x,) 2 ]. 



e=i 



(35) 

Since £7 x\ < a^) 2 , the RHS above could be simply bounded by 2Ko(^2 i x «) 2 > 
however, in doing so one might lose some information of the impact of regularity 
(or irregularity). Due to (13"5]) . in the definition of the process and the related 
calculations leading to (33)-(34) in [3J, the constant K should be replaced by 



k = k (i+ J(jr 



2 S(dx) 



In particular, one can conclude that for any s > and d > 1 



supE sup 

">1 \*e[o,s] 



N n (t) 



v n (t) 



D(d,s)< 00, (36) 
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and moreover that lim^o D(d, s) = 0. As indicated earlier, we then have N n (t)/v n (t) — > 
1, as t — > in L d , for each d > 1. 

As a consequence of the previous observations, and arguments very similar to 
those for Theorem (TJ it is not difficult to check that, for each fixed t > 0, 

E(M"(t)) _E(f*N"(u)du) _^ 



as well as 



7 • £t(n) j Q v n (u) du 

E(M n ' c (t)) _ E(f* N n ' c (u) du) 
1-Zt{n) f*v n (u)du 



10 

implying (1371) with M n; ° in place of M n . Under the assumption of locally finite 
genealogies, d\{n) ~ £(n) and 



E[M n (r™) — M n (l)] = £ 



7 / iV n (w) 



< 7 £[iV n (l) • (r™ - 1)] < C, Vn > 1, 



where the final uniform estimate is due to sup n ^(A^l)) < oo, and sup ra E{r™ — 
l|7V n (l)) < ^(sup^r™) < oo. Hence, for a locally finite H-genealogy, both 
M n (r?))/(-f£(n)) and M n '°(r^) / {^£{n)) converge to 1 in the mean. O 
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